Viewpoint Paper: Auditing the Semantic Completeness of SNOMED CT Using Formal Concept Analysis

نویسندگان

  • Guoqian Jiang
  • Christopher G. Chute
چکیده

OBJECTIVE This study sought to develop and evaluate an approach for auditing the semantic completeness of the SNOMED CT contents using a formal concept analysis (FCA)-based model. DESIGN We developed a model for formalizing the normal forms of SNOMED CT expressions using FCA. Anonymous nodes, identified through the analyses, were retrieved from the model for evaluation. Two quasi-Poisson regression models were developed to test whether anonymous nodes can evaluate the semantic completeness of SNOMED CT contents (Model 1), and for testing whether such completeness differs between 2 clinical domains (Model 2). The data were randomly sampled from all the contexts that could be formed in the 2 largest domains: Procedure and Clinical Finding. Case studies (n = 4) were performed on randomly selected anonymous node samples for validation. MEASUREMENTS In Model 1, the outcome variable is the number of fully defined concepts within a context, while the explanatory variables are the number of lattice nodes and the number of anonymous nodes. In Model 2, the outcome variable is the number of anonymous nodes and the explanatory variables are the number of lattice nodes and a binary category for domain (Procedure/Clinical Finding). RESULTS A total of 5,450 contexts from the 2 domains were collected for analyses. Our findings revealed that the number of anonymous nodes had a significant negative correlation with the number of fully defined concepts within a context (p < 0.001). Further, the Clinical Finding domain had fewer anonymous nodes than the Procedure domain (p < 0.001). Case studies demonstrated that the anonymous nodes are an effective index for auditing SNOMED CT. CONCLUSION The anonymous nodes retrieved from FCA-based analyses are a candidate proxy for the semantic completeness of the SNOMED CT contents. Our novel FCA-based approach can be useful for auditing the semantic completeness of SNOMED CT contents, or any large ontology, within or across domains.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Auditing the Semantic Completeness of SNOMED CT Using Formal Concept Analysis

Design: We developed a model for formalizing the normal forms of SNOMED CT expressions using FCA. Anonymous nodes, identified through the analyses, were retrieved from the model for evaluation. Two quasiPoisson regression models were developed to test whether anonymous nodes can evaluate the semantic completeness of SNOMED CT contents (Model 1), and for testing whether such completeness differs...

متن کامل

Large-Scale, Exhaustive Lattice-Based Structural Auditing of SNOMED CT

One criterion for the well-formedness of ontologies is that their hierarchical structure forms a lattice. Formal Concept Analysis (FCA) has been used as a technique for assessing the quality of ontologies, but is not scalable to large ontologies such as SNOMED CT (> 300k concepts). We developed a methodology called Lattice-based Structural Auditing (LaSA), for auditing biomedical ontologies, im...

متن کامل

بررسی تطبیقی سیر تکامل و ساختار سیستم های نامگذاری نظام یافته پزشکی SNOMED در کشورهای آمریکا ، انگلستان و استرالیا 86-85

Background and Aim: Systematized Nomenclature of Medicine systems are the important supportive for electronic health record in registration and retrieval of data. Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT) is the most comprehensive language and then the consistency of exchanged data across health care providers and finally the high effectiveness of health care. Material...

متن کامل

Semantic Tagging of Medical Narratives with Top Level Concepts from SNOMED CT Healthcare Data Standard

Medical narratives written by clinicians constitute critical information in healthcare domain and are required to be correct with respect to contextual meaning. SNOMED CT (Systematized Nomenclature of Medicine -Clinical Terms) is a standardized reference terminology that consists of 390023 SNOMED CT concepts with SNOMED CT codes. This paper describes the extraction of SNOMED CT concepts from fr...

متن کامل

Integrating Semantic Medical Entity Relations for Disease Prediction Using SNOMED-CT Terminology

Methods: In this paper, we propose a novel context-enhanced disease prediction approach based on leveraging semantic and contextual medical entity relations. Patient signs and symptoms are first mapped to SNOMED-CT concepts, which compose a feature space for disease prediction. Our major contributions in this paper consist of expanding the feature space using semantic and contextual concept rel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of the American Medical Informatics Association : JAMIA

دوره 16 1  شماره 

صفحات  -

تاریخ انتشار 2009